This data set contains transcribed high-quality audio of random Spanish
sentences recorded by volunteers in Buenos Aires, Argentina. The data set
consists of wave files, and a TSV file (line_index.tsv). The file line_index.tsv
contains a anonymized FileID and the transcription of audio in the file.
<p>
The data set also contains recordings of simple weather messages recorded in
Argentinian Spanish (90 messages), and Peninsular Spanish (90 messages).
<p>
The data set has been manually quality checked, but there might still be errors.
<p>
Please report any issues in the following issue tracker on GitHub.
<a href="https://github.com/googlei18n/language-resources/issues">
  https://github.com/googlei18n/language-resources/issues
</a>
<p>
See LICENSE file for license information.
<p>
Copyright 2018, 2019 Google, Inc.
<p>
If you use this data in publications, please cite it as follows:
<pre>
  @inproceedings{guevara-rukoz-etal-2020-crowdsourcing,
    title = {{Crowdsourcing Latin American Spanish for Low-Resource Text-to-Speech}},
    author = {Guevara-Rukoz, Adriana and Demirsahin, Isin and He, Fei and Chu, Shan-Hui Cathy and Sarin, Supheakmungkol and Pipatsrisawat, Knot and Gutkin, Alexander and Butryna, Alena and Kjartansson, Oddur},
    booktitle = {Proceedings of The 12th Language Resources and Evaluation Conference (LREC)},
    year = {2020},
    month = may,
    address = {Marseille, France},
    publisher = {European Language Resources Association (ELRA)},
    url = {https://www.aclweb.org/anthology/2020.lrec-1.801},
    pages = {6504--6513},
    ISBN = {979-10-95546-34-4},
  }
</pre>
